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ALTERING NETWORK TRANSMITTED CONTENT DATA 
BASED UPON USER SPECIFIED CHARACTERISTICS 



FIELD OF THE INVENTION 
5 The present invention relates generally to computer networks, and more 

specifically, to a system for transforming data transmitted over a network through 
characteristics specified by a user. 



BACKGROUND OF THE INVENTION 

10 

The basic functions of a computer network are to transmit, exchange or store data 
transmitted among computers coupled to the network. Most network implementations 
use a computer network simply as a point-to-point system to route and channel 
information among the networked computers. Some processes, such as compression or 

15 encryption techniques that speed transmission rates or enhance transmission security may 
be implemented on the transmitted data. In general, however, relatively little processing 
is performed on most data once it is transmitted fi-om the sending terminal. Data is 
typically processed at the sending terminal and transmitted to the receiving terminal in its 
processed form. Standard network transmission systems therefore do not provide 

20 flexibility or opportunity for a receiver or third party to transform or process the data 
according to the receiving party's needs. 

Present communication systems also typically do not provide effective 
mechanisms in which the relative location of various users is reflected in the audio output 
of characters representing the users in a networked game or other application. 
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What is needed, therefore, is a system that allows transmitted content data to be 
processed or transformed according to a receiver's needs after it has been generated and 
transmitted by a sending terminal. 



SUMMARY AND OBJECTS OF THE INVENTION 

It is an object of embodiments of the present invention to provide an improved 
content data output system that enhances interactive computer applications, such as 
networked video games and chat appUcations. 
5 It is a further object of embodiments of the present invention to provide audio 

output that reflects the relative physical location of users as they are distributed in the 
network. 

A system for converting content data transmitted over a computer network from a 
first computer to a second computer is disclosed. Content data comprising text or audio 

10 data is input into the first computer. The content data is digitized to produce digitized 
content data. If the content data comprises audio data, the data is digitized through a 
speech to text process. Parameters controlling the modification of the digitized content 
data are received from a user of a second computer. The parameters are input into a 
graphical user interface provided for the user and dictate output voice characteristics such 

15 as, gender, expression, accent, and language. The digitized content data is altered in 
accordance v^th the content data output characteristics specified by the user, and then 
provided to the second computer for output as modified voice data. In one embodiment, 
the relative location information of the users in a network is determined. This location 
information is used to further modify the audio output from characters representing the 

20 users. 

Other objects, features, and advantages of the present invention will be apparent 
from the accompanying drawings and from the detailed description that follows below. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example and not limitation in the 
figures of the accompanying drawings, in which like references indicate similar elements, 
and in which: 

5 Figure 1 illustrates a block diagram of a computer network system that 

implements embodiments of the present invention; 

Figure 2 illustrates a block diagram of a network that includes a content data 
conversion process for text data, according to an embodiment of the present invention; 
Figure 3 illustrates a block diagram of a network that includes a content data 
10 conversion process for voice data, according to an embodiment of the present invention; 

Figure 4 is a flow diagram illustrating the processing of data through the voice 
conversion process illustrated in Figure 3, according to one embodiment of the present 
invention; 

Figure 5 illustrates a character profile setup input screen displayed in a graphical 
15 user interface system, according to one embodiment of the present invention; 

Figure 6 illustrates a networked game environment in which user game consoles 
communicate over a network, according to one embodiment of the present invention; 
and 

Figure 7 illustrates a networked game environment in which user game consoles 
20 communicate over a network, according to an alternative embodiment of the present 
invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

A data conversion system for processing downloaded content over a computer 
netvi^ork is described. In the following description, for purposes of explanation, 
numerous specific details are set forth in order to provide a thorough understanding of the 

5 present invention. It will be evident, however, to one of ordinary skill in the art, that the 
present invention may be practiced v^thout these specific details. In other instances, 
well-known structures and devices are shown in block diagram form to facilitate 
explanation. The description of preferred embodiments is not intended to limit the scope 
of the claims appended hereto, 

10 Aspects of the present invention may be implemented on one or more computers 

executing software instructions. According to one embodiment of the present invention, 
server and client computer systems transmit and receive data over a computer network or 
standard telephone line. The steps of accessing, downloading, and manipulating the data, 
as well as other aspects of the present invention are implemented by central processing 

15 units (CPU) in the server and client computers executing sequences of instructions stored 
in a memory. The memory may be a random access memory (RAM), read-only memory 
(ROM), a persistent store, such as a mass storage device, or any combination of these 
devices. Execution of the sequences of instructions causes the CPU to perform steps 
according to embodiments of the present invention. 

20 The instructions may be loaded into the memory of the server or client computers 

from a storage device or from one or more other computer systems over a network 
connection. For example, a client computer may transmit a sequence of instructions to 
the server computer in response to a message transmitted to the client over a network by 
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the server. As the server receives the instructions over the network connection, it stores 
the instructions in memory. The server may store the instructions for later execution, or 
it may execute the instructions as they arrive over the network connection. In some 
cases, the dovmloaded instructions may be directly supported by the CPU. In other cases, 

5 the instructions may not be directly executable by the CPU, and may instead be executed 
by an interpreter that interprets the instructions. In other embodiments, hardwired 
circuitry may be used in place of, or in combination with, software instructions to 
implement the present invention. Thus, the present invention is not limited to any 
specific combination of hardware circuitry and software, nor to any particular source for 

10 the instructions executed by the server or client computers. 

Figure 1 is a block diagram of a computer network system that can be used to 
implement a data transmission and conversion, according to one embodiment of the 
present invention. The system 100 of Figure 1 enables the transmission and conversion 
of content data. The term "content data" in the context of the specification and claims 

15 shall be understood to refer to any type of dovraloadable data, which may consist of any 
one of text data, video linear streaming data, such as motion picture data in MPEG or 
MPEG2 format; hnear audio streaming data, such as music data in MPS format; binary 
program data; voice data; or any combination of such data or similar data. In general, 
content data does not include services or data that are used solely to provide access to a 

20 network, such as browser software or protocol handlers whose main function is only to 
establish a network connection. 

Figure 1 illustrates a computer network system 100 that implements one or more 
embodiments of the present invention. In system 100, a network server computer 104 is 
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coupled, directly or indirectly, to one or more network client computers 102 through a 
network 110. The network interface between server computer 104 and client computer 
102 may also include one or more routers. The routers serve to buffer and route the data 
transmitted between the server and client computers. Network 110 may be the Intemet, a 

5 Wide Area Network (WAN), a Local Area Network (LAN), intranet, extranet, vweless 
network, or any combination thereof. 

In one embodiment of the present invention, the server computer 104 is a World- 
Wide Web (WWW) server that stores data in the form of 'web pages' and transmits these 
pages as Hypertext Markup Language (HTML) files over the Intemet network 1 10 to one 

10 or more of the client computers 102. For this embodiment, the client computer 102 runs 
a "web browser" program 1 14 to access the web pages served by server computer 104, 
Additional web based content can be provided to client computer 102 by separate content 
providers, such as supplemental server 103. 

In one embodiment of the present invention, server 104 in network system 100 

15 includes a download service management process 112 that is configured to handle 
download requests fi'om a user. Access to the server 104, which may comprise one of 
several servers, is facilitated typically through a router on network 110 which directs 
requests to the download management server. When the server 104 receives requests 
from a user, the server executes a download of requested content ftom a contents 

20 database that is stored internally or externally to the server. Along with processing 

requests for downloading of content data, the server 104 may also retrieve the requesting 
user's customer data firom a customer database and attach it to the requested primary 
contents or use it to modify content or transmission parameters for particular users. This 
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data is then transmitted via the network 1 10 by means of a known networking protocol 
standard, such as the file transfer protocol (ftp). 

In one embodiment of the present invention, wherein network 1 10 is the Intemet, 
network server 104 also executes a web server process 1 16 to provide HTML documents 

5 to client computers coupled to network 110. To access the HTML files provided by 
server 104, client computer 102 runs a web client process (typically a web browser) 1 14 
that accesses and provides links to web pages available on server 104 and other Internet 
server sites. It should be noted that a network system 100 that implements embodiments 
of the present invention may include a larger number of interconnected client and server 

10 computers than shown in Figure 1 . 

The network 1 10 is normally a bi-directional digital commxmications network that 
connects the user's terminal hardware with the download management server provided on 
the server side of the system. With current technologies, a CATV (cable television) bi- 
directional network, ISDN (Integrated Services Digital Network), DSL (Digital 

15 Subscriber Line), or xDSL high-speed networks are examples of existing network 
infrastructures enabling the necessary network connections for implementing 
embodiments of the present invention. 

The client computer of system 100 may comprise a personal computer that 
includes a modem or network adapter, or it may comprise a networked game console 

20 (entertainment system) that utilizes a detachable storage medium therein, and a TV 

monitor or any other suitable display device connected to the game console. The modem 
or network adapter is a device that is used to connect the client's terminal hardware, e.g., 
a game console, for cormection to the network 110. For example, if network 1 10 is a 



9 



CATV network, the modem may be implemented as a cable modem device; and if 
network 1 10 is an ISDN network, the modem may be implemented as a terminal adapter. 

The server can supply digital content such as voice data, music clips, full-length 
audio and video programs, movies, still picture data, and other similar types of content. 
5 The content might further comprise promotional or advertising data associated with the 
primary content, such as movie previews, demo games, sample data, and other similar 
types of content. 

In one embodiment, network system 100 includes a conversion system that 
transforms or processes the data transmitted from the server to the client to improve the 

10 user interface and quality of entertainment. For the embodiment in which the transmitted 
data comprises voice data, the conversion system can be used in various IP telephony, 
network chat, video game, or 3D virtual chat applications, among others. 

Figure 2 illustrates a block diagram of a network that includes a content 
conversion process, according to one embodiment of the present invention. For the 

15 embodiment illustrated in Figure 2, the data transmitted from the server comprises text 
data 201 generated by a server computer and transmitted to a client computer over a 
network 210. The text data is output converted into voice output through a digital-to- 
analog (D/A) converter 208 coupled to the cUent computer. The conversion system 202 
includes a conversion process 204 and a receiver preference database 206. In one 

20 embodiment, the conversion system 202 is resident within the client computer. 

Alternatively, the conversion system 202 can be included within a separate computer 
coupled to the network and the client computer. 

The conversion process 204 includes circuits or processes that convert the input 
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text data to output data, as well as processes that modify or transform the characteristics 
of the text data. For example, for voice output, the conversion process can be used to 
control various characteristics such as, tone, accents, intonation, and effects, such as 
echo, reverberation, and so on. For speech output, the conversion process can control 
5 characteristics such as language, dialect, expression, and so on. For example, the 

conversion process 204 may include a translator that translates speech in one language to 
another language. The conversion process can also include processes that mimic the 
voice characteristics of well-known characters or personalities. 

Figure 3 illustrates a block diagram of a network that includes a content 
O 10 conversion process for content data that comprises voice data, according to one 
'fzf embodiment of the present invention. For the embodiment illustrated in Figure 3, the 

data transmitted from the server comprises voice data generated by a server computer and 
J n transmitted to a client computer over a network 310. The voice data 302 is first input 

p through an analog-to-digital (A/D) converter 302 for conversion into digital form. The 

W 15 voice packets can be addressed in one of several ways, including Unicast, Multicast, or 
p broadcast format. 

Alternatively, if the voice data includes data that is first input into the server 
computer, the data can be digitized prior to input to the server computer. For example, a 
microphone or other input means may include an A/D converter to convert the voice data 
20 to digital form prior to input to the server computer. The digitized voice data is then 
transmitted over network 3 10 for further processing by voice conversion means 312. 

The voice of the transmitted data can be changed and sent to other assigned 
user(s) over the network using a protocol such as Voice over IP (VoIP). The voice can be 



11 



changed based on various factors such as virtual character talk parameters, or user 
provided preferences. The digitized voice data is transformed into output voice data at 
the client computer through digital-to-analog (D/A) converter 304. The digitized voice 
data output from A/D converter 302 is processed through conversion system 312. The 

5 conversion system 312 includes a voice conversion process 314 and a conversion rules 
database 316. Altematively, the digitized voice data can be converted to analog form 
after output from the client computer through an external D/A converter. Such a D/A 
converter may be incorporated into speaker, headphone, or other sound output systems 
that receive digital audio output from the client computer. 

10 The voice conversion process 314 comprises processes that alter or modify the 

digitized voice data output from A/D converter 302 in the server computer into converted 
voice data to be output from D/A converter 304 on the client computer. Figure 4 
illustrates the basic flow of data through the voice conversion process illustrated in 
Figure 3, according to one embodiment of the present invention. In flow diagram 400, 

15 audio data 402 represents the digitized voice data that is output from A/D converter after 
input into the server computer through an input device, such as a microphone. The 
digitized audio data 402 is converted into text data 404 through a voice recognition 
process that converts digitized audio data to equivalent digital text data. The text data 
404 is then processed by a text conversion process 414 to produce converted text data 

20 406. This converted text data 406 is then processed through a voice synthesis process 
416 to produce audio data 408. The audio data 408 comprises digital audio data that is 
input to D/A converter 304 for conversion to analog voice to be output through speakers 
on the client computer. 
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The text conversion process 414 includes several sub-processes that alter the 
original voice data to change the voice as it is played back on the cUent computer. Such 
changes can include modification of the original voice tone, accent, intonation, and so on. 
The text conversion process can also include processes that alter the substance of the 

5 input data, such as language translation (e.g., English-French) or dialect translation. 
Primarily, the text conversion process alters the expression of the original voice data. 
The expression shows a character's personality or attribute (e.g., male or female or child 
speaker), character's circumstance or environment (e.g., in a tunnel, cave, etc.), the 
character's condition (e.g., excited, sad, injured, etc.). The text conversion process can 

10 also include special effects that alter the input voice data, such as Doppler effect, echo, 
and so on. 

In one embodiment of the present invention, the characteristics that dictate how 
the voice data is converted are provided by a conversion rules process 316. The rules 
process 316 specifies several parameters used by the voice conversion process 314 that 

15 are used to alter the input voice data. The voice rules process 316 includes user provided 
character profiles. In one embodiment, the character profiles are entered by the user 
through a user interface provided on the client computer. 

The character profile can be used to tailor the voice that a displayed character 
speaks with in applications such as video games, educational programs, interactive 

20 applications, text-to-speech programs, and the like. The character talking voice is 

determined by fimdamental parameters, such as frequency, waveform, etc.). The voice 
conversion process shapes the basic waveform to produce a converted voice that 
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corresponds to the selected character profile. In one embodiment, a user can set the 
profile for the character. 

Figure 5 illustrates a graphical user screen that illustrates a character profile input 
display. The character profile set up display window 500 includes several user selectable 
input fields that the user can change to alter the characteristics of the voice output. The 
user first selects the gender of the character that will recite the playback voice. As 
shown, the user can select a man's voice or a woman's voice. Other voice type 
characteristics can also be provided, such as child or baby. Various voice characteristics 
are also provided, such as age, sociability, activity, intelligence, and masculinity. Each of 
these characteristics shapes the voice playback parameters. For example, choosing an 
older age or increasing the masculinity generally lowers the voice. The sociability, 
activity, and intelligence characteristics generally affect how active and articulate the 
playback voice is portrayed. 

For the embodiment illustrated in Figure 5, the user characteristics are displayed 
as bar sUdes that the user can move through an input device, such as a mouse, to select a 
relative value for the respective characteristic. It should be noted that various other input 
methods could be provided, such as numerical value entries, percentage value entries, and 
the like. 

In an alternative embodiment, the character's talking voice can be created based 
on each pre-set characters profile. For this embodiment, the rules process 316 can 
include a user specified database that stores certain parameters or data entries for various 
variables of the voice data. For example, database parameters could include values that 
dictate the gender of the output voice, language, expression, and so on. Through the use 



14 



of such a database, the voice data output on the client computer could, for example, be set 
to speak in English in a male voice with an English accent. 

In one embodiment of the present invention, the voice conversion process is 
implemented in a distributed interactive game system comprising a plurality of 
networked games coupled among two or more users. Figure 6 illustrates a networked 
game environment in which user game consoles communicate over a network, according 
to one embodiment of the present invention. A first user game console 605 is coupled to 
network 608 through a cable modem 606. For this embodiment, network 608 is typically 
a cable TV (CATV) network. Also coupled to game console 605 is a speaker pair 604 for 
voice output, and a microphone 602 for voice input. A second user game console 607 is 
coupled to network 608 through a cable modem 612. A microphone 614 and speaker pair 
616 is coupled to the game console 607. 

In system 600, a server computer 610 may be coupled to network 608. The server 
computer can perform a variety of functions, such as game monitoring, providing game 
or application programs, managing user accounts, and the like. 

Figure 7 illustrates a networked game environment in which user game consoles 
communicate over a network, according to one embodiment of the present invention. 
For system 700, the network 708 comprises the Intemet, and the first game console 705 is 
coupled to the second game console 707 through Voice over IP gateways 706 and 712. 
Each game console is attached to a speaker 704, 716, and microphone 702 and 714 set, 
respectively. 

For the embodiments illustrated in Figure 6 and 7, the output voice characteristic 
depends upon user information. In this maimer, each user participant (player) can have a 
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different voice assigned to his character or terminal It is assumed that each user controls 
a character that is displayed on the terminal of each game console. The characteristics of 
the character's voice can then be determined based on the location of the user to whom 
the character belongs. For example, assuming each game console has a left and right pair 
of speakers, the output voice volxime ratio of the speaker pair is set based on the direction 
of the sender location. This provides some spatial effect of the voice relative to the 
location of the speaking character. The volume can also be changed based on the 
distance between the sender and the receiver. Alternatively, when a plurality of users is 
commxmicating with one another, each user's voice is assigned to each speaker based on 
their location. 

The user location determination process is included in the voice conversion 
process as a means of altering the voice of a character played back on a user game 
console. In this process, the direction or/and distance between the sender and the receiver 
is calculated and the volume ratio of the left-right speaker pair is set based on the 
calculated data. In the case of surround-sound environment in which multiple speakers 
are coupled to a console, the other speakers are also considered. 

In one embodiment, user location information for a plurality of networked game 
players is determined by using address information for each of the players. Address 
information can be stored in a database provided in each game console. 

The address or location information may be provided by using the telephone 
number for each player. In this case, the area code provides a rough approximation of a 
user's location relative to the other users. An address database related to telephone 
numbers is stored in the memory of each terminal. A particular user's terminal receives a 
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sender's telephone nximber and retrieves the location based on the telephone nxmiber. 
Using the retrieved location data and the user's own location data, the receiver terminal 
calculates the direction or/and distance. 

In an alternative embodiment, the location information can be provided using a 
personal database stored in each game console memory (e.g., secondary memory). For 
this case, each user has to input the other user's addresses in advance. Zip code 
information could be used to provide reasonable approximations of user locations. The 
information is stored in a memory location of the game console. When a connection 
between users is estabUshed, ID information (e.g., user ID, telephone No., etc.) is sent to 
each user. Using the ID information, the user location is retrieved in each personal 
database and the direction and/or distance is calculated based on the user location. 

Instead of storing user location information in each game console, the address 
information for a group of networked users can be stored in a central server, such as 
server 610 in Figure 6. For this embodiment, the server stores the addresses or location 
information (zip code, area code, etc.) for all of the users in a database. The direction 
and/or the distance are calculated based on the stored user information in the server. The 
server sends each user direction and/or distance information for the other users. Each 
individual user terminal then sets the volume ratio or whole volume based on the location 
mformation. For this embodiment, voice data is sent to each user through the server. 

It should be noted that the process of altering the data in accordance with output 
voice characteristics can be implemented either in the server (data sending) computer, the 
client (data receiving) computer, or a network server computer coupled to the server and 
client computer. Each computer capable of altering the transmitted data would have 
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associated with it a voice or text conversion means, such as that illustrated in Figure 4. 
Such a conversion means could be implemented in hardware circuitry coupled to the 
computer, a software program executed by the computer, or a combination of dedicated 
hardware and software processes. Moreover, the database storing the various voice 

5 characteristics for each associated client computer or character within a client computer 
can be stored locally in each client computer or centrally in a database accessible to a 
network server computer. 

Depending upon where the output alteration process is performed, the steps of 
transmitting, altering, and receiving the data can be done in various different step 

10 sequences. For example, the data can be first transmitted from the server computer, 
altered in the server or other computer, and then received by the client computer. If the 
alteration process is performed by the client computer, the process can be performed by 
first transmitting the data from the server computer to the client computer, receiving the 
data in the cUent computer, and then altering the data in accordance with the specified 

1 5 . output characteristics. 

Besides game or entertainment programs, the voice conversion process described 
in relation to Figures 6 and 7 can be used in various other applications involving speech 
content transmitted among a plurality of users. Examples include chat room applications, 
Internet telephony, and other similar appUcations. 

20 In the foregoing, a system has been described for modifying transmitted content 

data based on user preferences. Although the present invention has been described v^th 
reference to specific exemplary embodiments, it will be evident that various 
modifications and changes may be made to these embodiments without departing from 
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the broader spirit and scope of the invention as set forth in the claims. Accordingly, the 
specification and drawings are to be regarded in an illustrative rather than a restrictive 
sense. 
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